Goto

Collaborating Authors

 tag prediction


Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election

arXiv.org Artificial Intelligence

Online reporting platforms have enabled citizens around the world to collectively share their opinions and report in real time on events impacting their local communities. Systematically organizing (e.g., categorizing by attributes) and geotagging large amounts of crowdsourced information is crucial to ensuring that accurate and meaningful insights can be drawn from this data and used by policy makers to bring about positive change. These tasks, however, typically require extensive manual annotation efforts. In this paper we present Uchaguzi-2022, a dataset of 14k categorized and geotagged citizen reports related to the 2022 Kenyan General Election containing mentions of election-related issues such as official misconduct, vote count irregularities, and acts of violence. We use this dataset to investigate whether language models can assist in scalably categorizing and geotagging reports, thus highlighting its potential application in the AI for Social Good space.


Using Posters to Recommend Anime and Mangas in a Cold-Start Scenario

arXiv.org Machine Learning

Item cold-start is a classical issue in recommender systems that affects anime and manga recommendations as well. This problem can be framed as follows: how to predict whether a user will like a manga that received few ratings from the community? Content-based techniques can alleviate this issue but require extra information, that is usually expensive to gather. In this paper, we use a deep learning technique, Illustration2Vec, to easily extract tag information from the manga and anime posters (e.g., sword, or ponytail). We propose BALSE (Blended Alternate Least Squares with Explanation), a new model for collaborative filtering, that benefits from this extra information to recommend mangas. We show, using real data from an online manga recommender system called Mangaki, that our model improves substantially the quality of recommendations, especially for less-known manga, and is able to provide an interpretation of the taste of the users.


Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

arXiv.org Machine Learning

Training convolutional networks (CNN's) that fit on a single GPU with minibatch stochastic gradient descent has become effective in practice. However, there is still no effective method for training large CNN's that do not fit in the memory of a few GPU cards, or for parallelizing CNN training. In this work we show that a simple hard mixture of experts model can be efficiently trained to good effect on large scale hashtag (multilabel) prediction tasks. Mixture of experts models are not new [7, 3], but in the past, researchers have had to devise sophisticated methods to deal with data fragmentation. We show empirically that modern weakly supervised data sets are large enough to support naive partitioning schemes where each data point is assigned to a single expert. Because the experts are independent, training them in parallel is easy, and evaluation is cheap for the size of the model. Furthermore, we show that we can use a single decoding layer for all the experts, allowing a unified feature embedding space. We demonstrate that it is feasible (and in fact relatively painless) to train far larger models than could be practically trained with standard CNN architectures, and that the extra capacity can be well used on current datasets.


Ask the GRU: Multi-Task Learning for Deep Text Recommendations

arXiv.org Machine Learning

In a variety of application domains the content to be recommended to users is associated with text. This includes research papers, movies with associated plot summaries, news articles, blog posts, etc. Recommendation approaches based on latent factor models can be extended naturally to leverage text by employing an explicit mapping from text to factors. This enables recommendations for new, unseen content, and may generalize better, since the factors for all items are produced by a compactly-parametrized model. Previous work has used topic models or averages of word embeddings for this mapping. In this paper we present a method leveraging deep recurrent neural networks to encode the text sequence into a latent vector, specifically gated recurrent units (GRUs) trained end-to-end on the collaborative filtering task. For the task of scientific paper recommendation, this yields models with significantly higher accuracy. In cold-start scenarios, we beat the previous state-of-the-art, all of which ignore word order. Performance is further improved by multi-task learning, where the text encoder network is trained for a combination of content recommendation and item metadata prediction. This regularizes the collaborative filtering model, ameliorating the problem of sparsity of the observed rating matrix.


Temporal Dynamics of User Interests in Tagging Systems

AAAI Conferences

Collaborative tagging systems are now deployed extensivelyto help users share and organize resources.Tag prediction and recommendation systems generallymodel user behavior as research has shown that accuracycan be significantly improved by modeling users’preferences. However, these preferences are usuallytreated as constant over time, neglecting the temporalfactor within users’ interests. On the other hand, littleis known about how this factor may influence predictionin social bookmarking systems. In this paper, weinvestigate the temporal dynamics of user interests intagging systems and propose a user-tag-specific temporalinterests model for tracking users’ interests overtime. Additionally, we analyze the phenomenon of topicswitches in social bookmarking systems, showing that atemporal interests model can benefit from the integrationof topic switch detection and that temporal characteristicsof social tagging systems are different fromtraditional concept drift problems. We conduct experimentson three public datasets, demonstrating the importanceof personalization and user-tag specializationin tagging systems. Experimental results show that ourmethod can outperform state-of-the-art tag predictionalgorithms. We also incorporate our model within existingcontent-based methods yielding significant improvementsin performance.